Key Points 


The No Child Left Behind Act of 2001 (NCLB) had mixed effects on the nation’s educa- 


tional ecosystem. 


NCLB succeeded in shifting the focus from inputs to outcomes, shining a light on perfor- 
mance of different student subgroups, and using reporting requirements to spur devel- 
opment of more-robust education data systems. 


But NCLB included several missteos—namely, setting unrealistic goals and expecta- 
tions, narrowly focusing on reading and math results, and designing an accountability 
system with an inflexible one-size-fits-all approach. 


Future accountability legislation should leverage NCLB’s high points while charting a 


path that avoids the law’s bumpier portions. 


With No Child Left Behind (NCLB) as the entrance 
ramp, we’ve now been traveling down the highway 
of school accountability for two decades. Parts of 
this road have wound through interesting terrain 
and scenery, yielding previously unseen views of 
our education system. Other stretches have been 
much less pleasant, with bumps, detours, and 
wrong turns throwing us off course. 

Considering the state of the educational world of 
2001, it’s clear why policymakers across the political 
spectrum agreed to build this road. The combina- 
tion of standards, testing, and accountability 
promised to ensure every student in the United 
States was proficient in math and reading. But now 
that we’ve been on this road for 20 years and still 
haven’t reached this destination—or even one 
close—it’s a good time to stop, look back, and take 
stock of the productive parts of the journey and the 
portions we would like to forget. 

In surveying the land through which the school 
accountability highway winds, I identify three major 


AMERICAN ENTERPRISE INSTITUTE 


things we ultimately got right about accountability, 
even if some of these successes were something of 
a happy accident. I also nail down three things we 
got wrong, discussing the conditions that led to 
these missteps and detailing their consequences. 
In reflecting on the accountability era’s successes 
and failures, I aim to contribute, however slightly, 
to charting a productive path forward for account- 
ability. 


School Accountability’s Successes 


With today’s hyperpolarized politics, it can be diffi- 
cult to believe there was a time in our not-so-distant 
past when there was broad bipartisan support for a 
major piece of federal legislation. But that was exactly 
the case during NCLB’s enactment in January 2002. 
At the signing ceremony in Hamilton, Ohio, 
President George W. Bush touted accountability as 
the law’s driving principle and confidently claimed 
the legislation would put our nation’s schools on 


“a new path of reform, and a new path of results.” 
Referring to the future of our nation, Sen. Edward 
Kennedy (D-MA) proclaimed, “No piece of legisla- 
tion will have a greater impact or influence on 
that.”* Reps. John Boehner (R-OH) and George 
Miller (D-CA), then the chair and ranking member, 
respectively, of the House Committee on Educa- 
tion and the Workforce, were similarly enthusias- 
tic about the future of our country’s public schools. 
This enthusiasm was even embedded in the legis- 
lation, which aimed to make every single student in 
the country proficient in reading and math by 2014. 

Although NCLB didn’t produce the educational 
nirvana politicians promised, the legislation’s account- 
ability aspects delivered some very real positives to 
the country’s educational ecosystem. They shifted 
the focus from inputs to outcomes, shined a light 
on performance of different student subgroups, 
and used reporting requirements to spur the devel- 
opment of data systems that have facilitated insights 
into our education system’s operations and effects. 


Shifting the Focus from Inputs to Outcomes. The 
federal government’s involvement in public educa- 
tion began in earnest with President Lyndon John- 
son’s 1965 signing of the Elementary and Second- 
ary Education Act. Spurred by concerns over edu- 
cational inequity, Title I of the legislation sent 
money to districts—via state departments of edu- 
cation—to spend on initiatives to improve the 
quality of education available to disadvantaged 
students. With only nominal reporting required 
to access Title I dollars, this initial federal foray 
into education began a long period in which dis- 
cussions of dollars dominated education policy 
debates. Expenditures were about the only aspect 
of education that were consistently measured, and 
thus drove policy discussions. 

This focus on inputs held sway throughout the 
1970s and into the 1980s. In the mid-1980s, though, 
the notion of educational outcomes began to creep 
into the policy realm. Most point to the National Com- 
mission on Excellence in Education’s 1983 report A 
Nation at Risk: The Imperative for Educational Reform as 
the effective starting point for that discussion. 
Following its hyperbolic portrayal of the state of 
American education, the report made some basic 
outcome-oriented policy recommendations targeted 
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at states, including adopting challenging standards 
and more-rigorous graduation requirements. 

The rest of the 1980s and the 1990s saw a series 
of initiatives—the National Education Summit, 
Goals 2000, and the Improving America’s Schools 
Act (IASA)—encouraging states to adopt stand- 
ards and administer aligned assessments that 
would, theoretically, provide insight into student 
outcomes. States were slow on the uptake, though, 
and at the turn of the century, many states still 
lacked the infrastructure required to generate 
meaningful outcome data. For example, Martin 
Carnoy and Susanna Loeb’s comprehensive cata- 
log of states’ testing requirements and accounta- 
bility systems shows that in the 1999-2000 school 
year, lowa and Nebraska had no state testing require- 
ments, while other states such as Arkansas, Colo- 
rado, and Oklahoma required testing in only one or 
two grades.4 And although states such as Florida, 
North Carolina, and Texas were early adopters of 
strong school accountability, more than one-fifth 
of states had no accountability policy at the turn of 
the century. 

NCLB changed that. By threatening to withhold 
Title I dollars, the legislation effectively ensured 
that by the 2005-06 school year, states would 
adopt rigorous standards in reading and math, test 
students with assessments aligned to those stand- 
ards, and begin holding schools accountable for the 
results. In doing so, NCLB flipped the script and 
pushed student outcomes to the forefront of policy 
debates. And although the focus on student out- 
comes hasn’t been without drawbacks, on balance 
it has been a positive development. 

Instead of focusing exclusively on inputs—using 
dollars as a proxy for quality—we are now much 
more likely to start policy discussions by asking 
how well schools are serving students. Inputs are 
clearly part of that discussion, but it has been pro- 
ductive for the policy conversation to progress to 
viewing inputs as a means to the end of improved 
student outcomes rather than as ends themselves. 


Shining a Light on Different Student Subgroups. 
Achievement gaps dominate the contemporary 
education landscape. Everyone with even a passing 
familiarity of the topography is aware of disparities 
by race, ethnicity, socioeconomic status, English- 
learner classification, disability status, and other 


characteristics. With such broad awareness of these 
disparities, it’s easy to assume that we’ve always had 
our finger on the pulse of subgroup performance. 
That’s far from the case, though. 

The National Assessment of Educational Pro- 
gress (NAEP) has disaggregated results for some 
student subgroups since the early 1990s. However, 
NAEP lacked universal state participation until 
2003; the subgroup disaggregation was done only 
for the national sample and states that elected to 
participate. And NAEP’s lack of alignment to any 
common set of state standards further compli- 
cated interpretation. IASA also made an effort at 
disaggregation, requiring states to design their read- 
ing and math assessments in a manner that enabled 
subgroup reporting. But states’ blasé approach to 
compliance with the law—particularly their slow- 
walking of the required assessments—led to hap- 
hazard reporting of results for different student 
groups. 

As was the case in shifting the focus from inputs 
to outcomes, NCLB was the inflection point for 
shining a light on subgroup performance. The law 
required detailed outcome reporting down to the 
school level for a wide range of student groups includ- 
ing those delineated by race or ethnicity, socioeco- 
nomic status, English-learner classification, and 
disability status. And the legislation’s accountabil- 
ity provisions stated that if even a single subgroup 
of students in a school failed to meet the law’s def- 
inition of adequate yearly progress (AYP), then the 
whole school would be classified as failing to make 
AYP. Such a blunt policy design will have down- 
sides—we’ll get to those below—but it directed 
immediate attention to the performance of differ- 
ent student groups. 

Over time, this attention led to a clear-eyed 
understanding of the states, districts, and schools 
in which different student groups were and were 
not scoring well on reading and math tests. This 
information allowed for a better grasp of the sets 
of students needing additional attention, more 
thoughtful consideration of how to help, and the 
opportunity to detect any indications of success 
for various improvement efforts. More generally, 
NCLB’s accountability provisions illuminated the 
outcomes of different student subgroups, which 
moved us from an “ignorance is bliss” world to one 
in which we had to consider that our schools served 
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some groups of students much better than it did 
others. Such illuminations hit particularly hard in 
states across the Upper Midwest and the North- 
east—such as Connecticut, Minnesota, and Wis- 
consin—that have long prided themselves on the 
quality of their schools but where subgroup dis- 
ageregation revealed some of the largest race- 
based achievement gaps. 


Developing Data Systems. Accountability requires 
data. NCLB’s accountability provisions required 
data on test scores, attendance, racial and ethnic 
group identification, sex, free- and reduced-price- 
lunch eligibility, disability status, and English- 
learner classification, among other data elements. 
And the provisions required all this information 
annually, not just once. At NCLB’s enactment, a 
significant number of districts, and even some states, 
lacked the technological infrastructure required to 
collect, maintain, and report these data. 

Given the Bill Clinton administration’s lax enforce- 
ment of IASA’s testing and reporting provisions, 
states seemed to initially expect the Bush admin- 
istration to take a similar approach with NCLB, 
considering the law’s aggressive timeline as a mere 
suggestion. But it quickly became apparent that 
wouldn’t be the case, with Secretary of Education 
Rod Paige making clear that waivers were off the 
table. This spurred states to kick implementation 
efforts into high gear, and states generally had 
their testing and accountability systems up and 
running by the 2005-06 school year—the original 
timeline specified by NCLB. 

States weren’t left completely high and dry in 
developing their data systems. Just after the pas- 
sage of NCLB, President Bush signed the Educa- 
tional Technical Assistance Act of 2002, which 
authorized the Statewide Longitudinal Data Sys- 
tems (SLDS) Grant Program. SLDS grants pro- 
vided states with up to $20 million per award— 
states could apply multiple times—to help estab- 
lish their data systems. States varied in how they 
leveraged these dollars, with states such as Califor- 
nia and Oklahoma doing the bare minimum and 
others such as Florida, North Carolina, and Ohio 
developing incredibly impressive data systems that 
link K-12 records with information on postsecond- 
ary education, the labor market, public assistance 


programs, the criminal justice system, and other 
societal institutions. 

These data systems have yielded a stunning num- 
ber of insights into our education system. We have 
a detailed understanding of achievement gaps— 
and disparities in other outcomes—across a range 
of demographic groups. We’ve come to more fully 
appreciate the importance of teachers and how 
their impact persists long after students leave their 
classrooms. We know so much about the effects of 
accountability policies, intended and otherwise. 
We can detail the ins and outs of different school 
choice programs such as charter schools, vouchers, 
and open enrollment. We’ve painted nuanced por- 
traits of student mobility and attendance patterns. 
In short, we’ve accumulated an incredible amount 
of knowledge about our nation’s public education 
system. And it all started with states working to 
meet NCLB’s testing and accountability provi- 
sions. 


NCLB’s Benefits. Although accountability didn’t 
propel our education system to the heights that 
politicians promised, it delivered real benefits. Our 
policy discussions now revolve around student 
outcomes rather than educational inputs. Data sys- 
tems have provided a basis for learning so much 
about our education system, including the perfor- 
mance of different student subgroups. 


Although accountability didn’t pro- 
pel our education system to the 
heights that politicians promised, tt 
delivered real benefits. 


Interestingly, with the partial exception of sub- 
group reporting, the accountability movement’s 
positive legacies went largely unmentioned at its 
outset. Politicians didn’t tout the data systems that 
the law would create. They didn’t hype the shift in 
policy discourse. No, they told us that holding our 
schools accountable would increase their quality 
to such a degree that all students—every single 
one—would be proficient in reading and math in 
just more than a decade. Juxtaposing the rhetoric 
of accountability with its reality hints at what the 


AMERICAN ENTERPRISE INSTITUTE 


federal government might be best suited to do in 
this realm. 


Where We Went Wrong 


Accountability clearly failed to deliver on the high- 
flying promises politicians made in the halcyon 
days of the early 2000s. But why? What went 
wrong? Twenty years down the road, we can address 
these questions. 

Three major mistakes were made at the outset 
of the accountability movement: setting unrealis- 
tic goals and expectations, narrowly focusing on 
reading and math results, and designing NCLB’s 
accountability system with an inflexible, one-size- 
fits-all approach. In hindsight, we can see how these 
three early missteps conspired to render accounta- 
bility, or at least NCLB’s version of it, dead in the 
water. 


Setting Unrealistic Goals and Expectations. It’s 
difficult to overstate the degree of rhetorical excess 
politicians employed when arguing for accountabil- 
ity. President Bush said strong accountability 
would “make sure not one single child gets left 
behind in America.”> Sen. Jon Corzine (D-NJ) pro- 
fessed optimism that strong accountability measures 
“will help narrow the educational achievement gaps 
that threaten every child’s access to the American 
dream.” At the time, politicians were likely aware 
of their hyperbole but thought it benign. 

Looking back, it’s conceivable that NCLB could 
have survived politicians’ aspirational rhetoric 
and impossible promises. Even in 2001, the public 
expected a degree of exaggeration and bombast 
from their elected representatives. But baking the 
unrealistic goals and expectations into the law— 
even going so far as to design the entire accounta- 
bility system around them—was bad policymaking 
and is ultimately what ruined NCLB’s public sup- 
port. 

Setting a goal of universal reading and math 
proficiency by 2014 effectively ensured we would 
end up judging accountability, and NCLB more 
broadly, as a failure. Even at the time, it was clear 
we were never going to achieve that goal or come 
close to it.” And because the law required states to 
specify a trajectory toward universal proficiency, 


the failure judgments wouldn’t be put off until 
2014. They would begin to trickle in much earlier. 

Secretary of Education Arne Duncan told Con- 
gress in spring 2011 that without waivers, more 
than 80 percent of schools would fail AYP that 
year. And as they did, they provided detractors with 
ammunition to chip away at accountability.’ And 
chip away they did. The annual PDK/Gallup poll on 
education found that the percentage of Americans 
viewing NCLB unfavorably nearly doubled between 
2005 and 2010, growing from 27 to 46 percent.? 

The primary danger of using an unattainable 
goal as the basis for stamping failing labels on a 
broad range of schools stems from the disconnect 
between those labels and parents’ evaluations of 
their children’s schools. Polling routinely shows 
majorities of parents grade their children’s schools as 
an A or a B, judgments typically formed by up-close 
observation of their children’s daily experiences.’° 
So when a policy comes along and says that their 
children’s school is failing, most parents will trust 
their own judgment more than an opaquely gener- 
ated label. And as these discrepancies accumulate, 
it will ultimately be the policy’s validity that par- 
ents call into question, not their judgment. 

Going forward, policymakers should specify incre- 
mental, achievable goals for their school improve- 
ment efforts. Of course, goals like this don’t pro- 
vide sound bites like promises of universal profi- 
ciency do. But reality doesn’t always fit into nice, 
neat sound bites. 


Narrowly Focusing on Reading and Math Test 
Results. We all want kids to read well and do math 
at a high level. And we should, as those skills are 
instrumental for navigating the world. But educa- 
tion is much more than that. It should prepare chil- 
dren for membership in a democratic polity. It 
should develop social and emotional skills that will 
be useful in all aspects of life. It should consistently 
expose students to a range of ideas, subjects, and 
extracurriculars so they can figure out what draws 
them in and what repels them. NCLB’s accountability 
system didn’t explicitly say education shouldn’t do 
these things, but by basing its ratings almost entirely 
on students’ performance on math and reading 
tests, it signaled what it valued. 

Educators and administrators responded to 
these signals, going to great lengths to boost math 
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and reading scores in their schools. Many schools 
devoted full weeks to test preparation strategies 
leading up to the annual state assessments. Dis- 
tricts often increased instructional time in reading 
and math at the expense of non-tested subjects 
such as art, music, and social studies.” In the most 
egregious cases, educators cheated on the tests to 
satisfy the seemingly bottomless appetite of NCLB’s 
accountability system.” 

In short, NCLB’s accountability system threw 
the educational ecosystem out of balance and incen- 
tivized counterproductive behaviors to game the 
system. And, adding insult to injury, all the time, 
effort, and dollars devoted to chasing universal 
proficiency didn’t have much effect. Achievement 
levels today are only slightly higher than they were 
20 years ago. We still see enormous gaps between 
advantaged students and their less advantaged peers. 
Well-designed accountability systems shouldn’t gen- 
erate these sorts of outcomes. 

The accountability movement’s excesses are per- 
haps best illustrated by the nature of the follow-on 
reform du jour, social and emotional learning (SEL). 
SEL explicitly based its appeal on educating the 
whole child, a not-so-subtle jab at the intense read- 
ing and math focus that characterized the account- 
ability era. And the enthusiasm with which folks 
across the educational landscape—liberals and 
conservatives, parents and teachers, funders and 
fundees—responded to this appeal illustrates how 
eager stakeholders were to move on from a policy 
conversation dominated by test-based accounta- 
bility. 


Federal Control Without Flexibility. Every con- 
sequential aspect of NCLB’s accountability sys- 
tem—testing requirements, AYP’s definition, the 
series of sanctions—was designed in DC and dic- 
tated to the states, which were given no meaningful 
freedom to adapt the system to their realities. And 
while this rigidity perhaps represented a rational 
response to states’ past foot-dragging, it resulted in 
a one-size-fits-all accountability system that was 
stunningly ill-suited to handle the variation that is a 
defining feature of American education. 

Further, the NCLB accountability system’s inflex- 
ible and punitive nature highlights its motivating 
assumption: Educators weren’t trying hard enough, 
and threatening to punish their schools would 


make them work harder and increase student 
learning. It’s a mistake to make policy based on 
assumptions that question educators’ motives and 
efforts. And restricting states and districts from 
adapting policy in a manner that moves away from 
such assumptions just compounded that mistake. 

A fascinating thought experiment involves con- 
trasting NCLB’s accountability system with what 
might have emerged without such strong federal 
intervention. At the time of NCLB’s 2001 signing, 
most states had some form of accountability sys- 
tem in place, but there was substantial variation in 
their designs. And it’s interesting to think what 
might have evolved had states been left on their 
own. Sure, some states wouldn’t have done any- 
thing meaningful around accountability, but oth- 
ers were seemingly on track to develop systems 
that would have avoided many of NCLB’s missteps. 
Systems in states such as Kentucky, Maryland, 
New Mexico, and North Carolina used test scores 
as the basis for providing monetary rewards. Other 
systems, including those in Connecticut and Mis- 
souri, used test results as an information point or 
indicator that particular schools or districts might 
need additional scrutiny. 


Going forward, policymakers should 
continue the march back toward 
increased state control of school 
accountability systems. 


NCLB immediately squashed this variation, 
instead effectively taking the harshest accountabil- 
ity system in the country, Texas’, and immediately 
implementing it everywhere. Given the direction 
accountability is headed in the post-NCLB years— 
less punitive, with greater state control—it is instruc- 
tive to look back and see that some states were 
ahead of their time. It’s also natural to ask whether 
we might have reached an accountability equilib- 
rium more quickly without NCLB. 

Going forward, policymakers should continue 
the march back toward increased state control of 
school accountability systems. In my view, the 
endgame is a set of systems in which states use dif- 
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ferent indicators—including test scores, gradua- 
tion rates, and absenteeism—to judge whether a 
school or system warrants a closer look. These 
deeper dives, which could be operated out of an 
office similar to a state auditor, could work to iden- 
tify the root causes of the concerning numbers. In 
something as complex as schooling, high-level 
numbers can tell you only so much, and it often 
takes setting foot inside a school to get a sense of 
what’s going well and what isn’t and to make pro- 
ductive recommendations about how the school 
might go about any improvement process. 


The Cost of These Missteps. These three major 
missteps helped turn the tide against accountabil- 
ity, putting a bad taste in the mouth of the public 
and leading to the scaled-down version in the 
Every Student Succeeds Act. Such a retreat could 
be healthy, giving accountability proponents time 
to reflect on what went wrong the first time and to 
work on developing more thoughtful and produc- 
tive approaches to ensuring school quality. But the 
long-term cost of these mistakes might be the pro- 
hibition of any idea with even a whiff of enhanced 
accountability from the policy arena. It’s possible 
that these mistakes so poisoned the well that any 
accountability-oriented reforms are off the table 
for the foreseeable future. 


Conclusion 


The accountability movement began with the best 
intentions, promising to pave a road to an educa- 
tional paradise where every single student in the 
country was proficient in reading and math. We’ve 
been on this road for 20 years and never arrived at 
this destination, instead ending up somewhere that 
doesn’t look dramatically different from where we 
started. 

The journey has certainly been interesting, though. 
Along the way, we’ve collected souvenirs we will 
long value—a focus on outcomes, impressive data 
systems, and a nuanced understanding of the out- 
comes of different student subgroups. But some 
wrong turns and detours sent us down desolate 
roads we’d like to forget. As we plan our next trip, 
we’d do well to leverage the high points of the last 
journey and chart a path that avoids the bumpier 
portions. 
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